SEQUENCE LISTING 



(1) GENER?Ui INFORMATION: 

(i)N^PLICANT: SHYJAN, Andrew 

(ii) TITK*^ OF INVENTION: NOVEL MULTIDRUG RES I STANCE -ASSOCIATED 
NYPEPTIDE 

(iii) NUMBER OlF SEQUENCES: 8 

(iv) CORRESPON^NCE ADDRESS: 

(A) ADDRESSEE: Testa, Hurwitz & Thibeault 

(B) STREET :\l25 High St. 

(C) CITY: Boston 

(D) STATE: MA^ 

(E) COUNTRY: 

(F) ZIP: 02110 

(v) COMPUTER READABLE fI 

(A) MEDIUM TYPE: Fl^py disk 

(B) COMPUTER: IBM PC^ompatible 

(C) OPERATING SYSTEM: ^^-DOS /MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.3 0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: WALLER, Patrick R H 

(B) REGISTRATION NUMBER: 41,418 

(C) REFERENCE/DOCKET NUMBER: MIL-001>S 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (617) 248-7000 

(B) TELEFAX: (617) 248-7100 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4847 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 116.. 4426 



-64- 



li 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO*: 1 : 
GGCTCATGCT CGCSGAGCGTG GTTGAGCGGC TGGCGCGGTT GTCCTGGAGC AGGGGCGCAG 



GAATTCTGAT GTG7 



^CTAA CAGTCTGTGA GCCCTGGAAC CTCCACTCAG AGAAG ATG 

Met 
1 



AAG GAT ATC GAC ATA ^A AAA GAG TAT ATC ATC CCC AGT CCT GGG TAT 
Lys Asp lie Asp lie Gly Lys Glu Tyr lie lie Pro Ser Pro Gly Tyr 

10 15 

AGA AGT GTG AGG GAG AGA \CC AGC ACT TCT GGG ACG CAC AGA GAC CGT 
Arg Ser Val Arg Glu Arg Thr Ser Thr Ser Gly Thr His Arg Asp Arg 
20 \ 25 30 



GAA GAT TCC AAG TTC AGG AGA 
Glu Asp Ser Lys Phe Arg Arg 
35 40 



VCT CGA CCG TTG GAA TGC CAA GAT GCC 
r Arg Pro Leu Glu Cys Gin Asp Ala 
45 



TTG GAA ACA GCA GCC CGA GCC GAG^ 
Leu Glu Thr Ala Ala Arg Ala Glu 

50 55 



vGGC CTC TCT CTT GAT GCC TCC ATG 
y Leu Ser Leu Asp Ala Ser Met 
60 65 




CAT TCT CAG CTC AGA ATC CTG GAT GAGNGAG CAT CCC AAG GGA AAG TAC 
His Ser Gin Leu Arg lie Leu Asp Glu Glu His Pro Lys Gly Lys Tyr 
70 \5 80 

CAT CAT GGC TTG AGT GCT CTG AAG CCC ATcNcGG ACT ACT TGC AAA CAC 
His His Gly Leu Ser Ala Leu Lys Pro lie Arg Thr Thr Cys Lys His 
85 90 \ 95 

CAG CAC CCA GTG GAC AAT GCT GGG CTT TTT TCcNtGT ATG ACT TTT TCG 
Gin His Pro Val Asp Asn Ala Gly Leu Phe Ser Oys Met Thr Phe Ser 
100 105 \ 110 

'GG CTT TCT TCT CTG GCC CGT GTG GCC CAC AAG AAGVsGG GAG CTC TCA 
rp Leu Ser Ser Leu Ala Arg Val Ala His Lys Lys \ly Glu Leu Ser 
115 120 125 

ATG GAA GAC GTG TGG TCT CTG TCC AAG CAC GAG TCT TCTX GAC GTG AAC 
Met Glu Asp Val Trp Ser Leu Ser Lys His Glu Ser SerVsp Val Asn 
130 135 140 \ 145 

TGC AGA AGA CTA GAG AGA CTG TGG CAA GAA GAG CTG AAT GMl GTT GGG 
Cys Arg Arg Leu Glu Arg Leu Trp Gin Glu Glu Leu Asn GliAval Gly 
150 155 \60 

CCA GAC GCT GCT TCC CTG CGA AGG GTT GTG TGG ATC TTC TGC Cdfc ACC 
Pro Asp Ala Ala Ser Leu Arg Arg Val Val Trp lie Phe Cys ArgVThr 
165 170 175 

AGG CTC ATC CTG TCC ATC GTG TGC CTG ATG ATC ACG CAG CTG GCT 
Arg Leu lie Leu Ser lie Val Cys Leu Met lie Thr Gin Leu Ala Gl:^ 
180 185 190 

TTC AGT GGA CCA GCC TTC ATG GTG AAA CAC CTC TTG GAG TAT ACC CAG 
Phe Ser Gly Pro Ala Phe Met Val Lys His Leu Leu Glu Tyr Thr Gin 
195 200 205 



60 
118 

166 

214 

262 

310 

358 

406 

454 

502 

550 

598 

646 

694 

742 



65 



GCA ACAX^G TCT AAC CTG CAG TAC 
Ala Thr GlAi Ser Asn Leu Gin Tyr 
210 \ 215 



CTC CTG ACG 
Leu Leu Thr 



ATC GTG CGG TCT 
GlV lie Val Arg Ser 
s,230 



AGC TTG TTG TTA GTG CTG GGC CTC 
Ser Leu Leu Leu Val Leu Gly Leu 
220 225 

TGG TCG CTT GCA CTG ACT TGG GCA 
Trp Ser Leu Ala Leu Thr Trp Ala 
235 240 



790 



838 



TTG AAT TAC CGA AG^C GGT GTC CGC 
Leu Asn Tyr Arg Thr^ Gly Val Arg 
245 



TTG CGG GGG GCC ATC CTA ACC ATG 
Leu Arg Gly Ala lie Leu Thr Met 
250 255 



886 



GCA TTT AAG AAG ATC CTT AAG TTA 
Ala Phe Lys Lys lie Leuv Lys Leu 
260 \ 265 



AAG AAC ATT AAA GAG AAA TCC CTG 
Lys Asn lie Lys Glu Lys Ser Leu 
270 



934 



GGT GAG CTC ATC AAC ATT TGC TCC 
Gly Glu Leu lie Asn lie Cys\Ser 
275 280 



AAC GAT GGG CAG AGA ATG TTT GAG 
Asn Asp Gly Gin Arg Met Phe Glu 
285 



982 



GCA GCA GCC GTT GGC AGC CTG C*] 
Ala Ala Ala Val Gly Ser Leu Leu^ 
290 295 



GCT GGA GGA CCC GTT GTT GCC ATC 
.la Gly Gly Pro Val Val Ala lie 
300 305 



1030 



TTA GGC ATG ATT TAT AAT GTA ATT 
Leu Gly Met lie Tyr Asn Val He 
310 



ATTV CTG GGA CCA ACA GGC TTC CTG 
Ile^eu Gly Pro Thr Gly Phe Leu 
3^5 320 



1078 




GGA TCA GCT GTT TTT ATC CTC TTT 
Gly Ser Ala Val Phe He Leu Phe 
325 

CGG CTC ACA GCA TAT TTC AGG AGA 
Arg Leu Thr Ala Tyr Phe Arg Arg 
340 345 

GT GTC CAG AAG ATG AAT GAA GTT 
rg Val Gin Lys Met Asn Glu Val 
355 360 

ATG TAT GCC TGG GTC AAA GCA TTT 
Met Tyr Ala Trp Val Lys Ala Phe 
370 375 



GAG GAG GAG CGT CGG ATA TTG GAA 
Glu Glu Glu Arg Arg He Leu Glu 
390 



TAC CCA\GCA ATG ATG TTT GCA TCA 
Tyr Pro Via Met Met Phe Ala Ser 
330 \ 335 

AAA TGC GTG\GCC GCC ACG GAT GAA 
Lys Cys Val \la Ala Thr Asp Glu 
350 



CTT ACT TAC ATT^ 
Leu Thr Tyr He 
365 



TTT ATC AAA 
Phe He Lys ' 



TCT CAG AGT GTT CAGXAAA ATC CGC 
Ser Gin Ser Val Gin Ijys He Arg 
380 \ 385 

AAA GCC GGG TAC TTC CA(iyAGC ATC 
Lys Ala Gly Tyr Phe Ginger He 
395 4^0 



1126 



1174 



1222 



1270 



1318 



ACT GTG GGT GTG GCT CCC ATT GTG 
Thr Val Gly Val Ala Pro He Val 
405 



GTG GTG ATT GCC AGC GTG GTG. ACC 
Val Val He Ala Ser Val ValVhr 
410 415 



1366 



TTC TCT GTT CAT ATG ACC CTG GGC 
Phe Ser Val His Met Thr Leu Gly 
420 425 



TTC GAT CTG ACA GCA GCA CAG GCT 
Phe Asp Leu Thr Ala Ala Gin Ala\ 
430 



1414 



TTC ACA GTG GTG ACA GTC TTC AAT 
Phe Thr Val Val Thr Val Phe Asn 
435 440 



TCC ATG ACT TTT GCT TTG AAA GTA 
Ser Met Thr Phe Ala Leu Lys Val 
445 



1462 



-66- 



ACA CCG TTT 
Thr Pro Phe^ 
450 



TCA GTA AAG TCC CTC TCA GAA GCC TCA GTG GCT GTT GAG 
,Ser Val Lys Ser Leu Ser Glu Ala Ser Val Ala Val Asp 
455 460 465 



1510 



AGA TTT AAG 
Arg Phe Lys 



AG^ TTG 
Ser^ Leu 
\4 70 



TTT CTA ATG GAA GAG GTT CAC ATG ATA AAG AAC 
Phe Leu Met Glu Glu Val His Met lie Lys Asn 
475 480 



1558 



AAA CCA GCC 
Lys Pro Ala 



AGT CfeT CAC ATC AAG ATA GAG ATG AAA AAT GCC ACC TTG 
Ser Pr\ His lie Lys lie Glu Met Lys Asn Ala Thr Leu 
485 \ 490 495 



1606 



GCA TGG GAC 
Ala Trp Asp 
500 



TCC TCC CAC TCC AGT ATC CAG AAC TCG CCC AAG CTG ACC 
Ser Ser Hig^ Ser Ser lie Gin Asn Ser Pro Lys Leu Thr 
505 510 



1654 



CCC AAA ATG 
Pro Lys Met 
515 



AAA AAA GAC AA<; 
Lys Lys Asp LysN 
520 



AGG GCT TCC AGG GGC AAG AAA GAG AAG 
Arg Ala Ser Arg Gly Lys Lys Glu Lys 
525 



1702 



GTG AGG CAG 
Val Arg Gin 
530 



CTG CAG CGC ACT 
Leu Gin Arg Thr 
535 



GACS CAT CAG GCG GTG CTG GCA GAG CAG 
GluNpis Gin Ala Val Leu Ala Glu Gin 
540 545 



1750 



AAA GGC CAC 
Lys Gly His 



CTC CTC CTG GAC 
Leu Leu Leu Asp 
550 



AGT GA(\ GAG CGG CCC AGT CCC GAA GAG 
Ser Asp X^lu Arg Pro Ser Pro Glu Glu 

560 



1798 



GAA GAA GGC 
Glu Glu Gly 



AAG CAC ATC CAC 
Lys His lie His 
565 



CTG GGC CAC\CTG CGC TTA CAG AGG ACA 
Leu Gly His \eu Arg Leu Gin Arg Thr 
570 \ 575 



1846 



CTG CAC 
Leu His 



1 T(^C GGC 
1/Cys Gly 
i 595 

GGC CAG 
Gly Gin 
610 



AGC 
Ser 
580 

AGT 
Ser 



ATG 
Met 



ATC GAT CTG GAG 
lie Asp Leu Glu 



ATC CAA GAG GGT> 
lie Gin Glu Gly 
585 



CTG GTT GGA ATC 
fs Leu Val Gly lie 
590 



GTG 
Val 



ACG 
Thr 



GGA 
Gly 



CTT 
Leu 



AGT 
Ser 



CTA 
Leu 
615 



GGA 
Gly 
600 

GAG 
Glu 



AAA ACC TCT CTC ATTVTCA GCC ATT TTA 
Lys Thr Ser Leu lie Xer Ala lie Leu 
605 

GGC AGC ATT GCA ATC AGT\GGA ACC TTC 
Gly Ser lie Ala lie Ser GJly Thr Phe 
620 \ 625 



1894 



1942 



1990 



GCT TAT GTG GCC CAG CAG GCC TGG ATC CTC AAT GCT ACT CTGXaGA GAC 
Ala Tyr Val Ala Gin Gin Ala Trp lie Leu Asn Ala Thr Leu TLrg Asp 
630 635 64,0 



2038 



AAC ATC CTG 
Asn lie Leu 



TTT GGG AAG GAA TAT GAT GAA GAA AGA TAG AAC TCT\GTG 
Phe Gly Lys Glu Tyr Asp Glu Glu Arg Tyr Asn Ser V^l 
645 650 655 



2086 



CTG AAC AGC TGC 
Leu Asn Ser Cys 
660 



TGC CTG AGG CCT GAC CTG GCC ATT CTT CCC AGC AGC^ 
Cys Leu Arg Pro Asp Leu Ala lie Leu Pro Ser Ser 
665 670 



2134 



GAC CTG ACG 
Asp Leu Thr 
675 



GAG ATT GGA GAG 
Glu lie Gly Glu 
680 



CGA GGA GCC AAC CTG AGC GGT GGG CAG 
Arg Gly Ala Asn Leu Ser Gly Gly Gin 
685 



2182 
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CGC 
Arg 
690 

TAG 
Tyr 



CAC 
His 



CTG 
Leu 



ATC 
lie 



Gil 



AGG 
Arg 



ATC CTG 
lie Let 



ATC TTC 
lie Phe 



ATC AGC 
lie Ser 



GAC GAC 
Asp Asp 
710 

^T AGT 
Ser 



CTT GCC 
Leu Ala 
695 

CCC CTC 
Pro Leu 



CGG GCC 
Arg Ala 



AGT GCC 
Ser Ala 



GCT ATC 
Ala lie 



TTT GTT 
Phe Val 
740 

TTC ATG 
Phe Met 
755 



721 

ACC CAC 
Thr HiS 



AAA GAG 
Lys Glu 



CAG TTA 
Gin Leu 



TGT 
Gl\ Cys 
760 



CGG AAA 
Arg Lys 
730 

CAG TAC 
Gin Tyr 
745 

ATT ACG 
lie Thr 



TTG TAT 
Leu Tyr 
700 

TTA GAT 
Leu Asp 
715 

CAT CTC 
His Leu 



AGT GAC 
Ser Asp 



GCC CAT 
Ala His 



AAG TCC 
Lys Ser 



CTG GTT 
Leu Val 



GAA Adk 
Glu Arg 



GAC TGT 
Asp Cys 
750 

GGC ACC 
Gly Thr 
765 



AGG AGC ATC 
Arg Ser lie 
705 

GTG GGC AAC 
Val Gly Asn 
720 

AAG ACA GTT 
Lys Thr Val 
735 

GAT GAA GTG 
Asp Glu Val 



CAT GAG GAA 
His Glu Glu 



2230 



2278 



2326 



2374 



2422 



CTG 
Leu 
770 

CTG 
Leu 



ATG AAT 
Met Asn 



GGA GAG 
Gly Glu 



TTA AAT 
Leu Asn 



ACA CCG 
Thr Pro 
790 



GGT G> 
Gly 
775 

CCA GTT 
Pro Val 



TAT GCT 
Tyr Ala 



GAG ATC 
GliAlle 



ACC ATT 
Thr lie 
780 

AAT TCA 
Asn Ser 
795 



TTT AAT 
Phe Asn 



AAA AAG 
Lys Lys 



AAC 
Asn 



GAA 
Glu 



CTG TTG 
Leu Leu 
785 

ACC AGT 
Thr Ser 
800 



2470 



2518 



GGT 
Gly 



AAG 
Lys 



GAA 
Glu 



TCA CAG 
Ser Gin 



AAG GAA 
Lys Glu 
820 

GAG AAA 
Glu Lys 
835 



AAG AAG 
Lys Lys 
805 

AAA GCA 
Lys Ala 



TCA CAA 
Ser Gin 



GTA AAG 
Val Lys 



GGG CAG GGT TCA 
Gly Gin Gly Ser 
840 



GAC AAG 
Asp Lys 
810 

CCA GAG 
Pro Glu 
825 

GTG CCC 
Val Pro 



GGT CCT 
rly Pro 



GAi\GGG 
Glu \ly 



TGG TCA\ 
Trp Ser 



AAA ACA 
Lys Thr 



CAG CTT 
Gin Leu 
830 

GTA TAT 
'al Tyr 
8^5 



GGA 
Gly 
815 

GTG 
Val 



TCA ATA 
Ser lie 



CAG CTG 
Gin Leu 



GGT GTC TAC 
Gly Val Tyr 



2566 



2614 



2662 



ATC CAG GCT GCT GGG GGC CCC TTG GCA TTC CTG GTTNATT 
lie Gin Ala Ala Gly Gly Pro Leu Ala Phe Leu Val \le 
850 855 860 

TTC ATG CTG AAT GTA GGC AGC ACC GCC TTC AGC ACC TGG' 
Phe Met Leu Asn Val Gly Ser Thr Ala Phe Ser Thr Trp 
870 875 

TAC TGG ATC AAG CAA GGA AGC GGG AAC ACC ACT GTG ACT 
Tyr Trp lie Lys Gin Gly Ser Gly Asn Thr Thr Val Thr 
885 890 

GAG ACC TCG GTG AGT GAC AGC ATG AAG GAC AAT CCT CAT 
Glu Thr Ser Val Ser Asp Ser Met Lys Asp Asn Pro His 
900 905 910 

TAT GCC AGC ATC TAC GCC CTC TCC ATG GCA GTC ATG CTG 
Tyr Ala Ser He Tyr Ala Leu Ser Met Ala Val Met Leu 
915 920 925 



ATG GCC CTT 
Met Ala Leu 
865 

fGG TTG AGT 
Leu Ser 
880 

CGA GGG AAC 
Arg GPy Asn 
895 

ATG CAG TiVC 
Met Gin Tyi 



ATC CTG AAA 
He Leu Lys 



2710 



2758 



2806 



2854 
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Gcc attXcga gga gtt gtc ttt gtc aag ggc acg ctg cga gct tcc tcc 

Ala lie Arg Gly Val Val Phe Val Lys Gly Thr Leu Arg Ala Ser Ser 
930 \ 935 940 945 

CGG CTG CA¥ gag GAG CTT TTC CGA AGG ATC CTT CGA AGC CCT ATG AAG 
Arg Leu His\Asp Glu Leu Phe Arg Arg lie Leu Arg Ser Pro Met Lys 
950 955 960 



2950 



2998 



TTT TTT GAC ACG ACC CCC ACA GGG AGG ATT CTC AAC AGG TTT TCC AAA 
Phe Phe Asp Thfy^Thr Pro Thr Gly Arg lie Leu Asn Arg Phe Ser Lys 
965\ 970 975 



3046 



GAC ATG GAT GAA Gl^T GAC GTG CGG CTG CCG TTC CAG GCC GAG ATG TTC 
Asp Met Asp Glu ValvAsp Val Arg Leu Pro Phe Gin Ala Glu Met Phe 
980 \ 985 990 



3094 



ATC CAG AAC GTT ATC CiSG GTG TTC TTC TGT GTG GGA ATG ATC GCA GGA 
lie Gin Asn Val lie Leii Val Phe Phe Cys Val Gly Met lie Ala Gly 
995 \lOOO 1005 



3142 



GTC TTC CCG TGG TTC CTT Gl 
Val Phe Pro Trp Phe Leu \ 
1010 1015 



GCA GTG GGG CCC CTT GTC ATC CTC TTT 
Ala Val Gly Pro Leu Val lie Leu Phe 
1020 1025 



3190 



-X' 



TCA GTC CTG CAC ATT GTC TCC AGG GTC CTG ATT CGG GAG CTG AAG CGT 
Ser Val Leu His lie Val Ser Ar^ Val Leu lie Arg Glu Leu Lys Arg 
1030 \ 1035 1040 

CTG GAC AAT ATC ACG CAG TCA CCT T^ CTC TCC CAC ATC ACG TCC AGC 
Leu Asp Asn lie Thr Gin Ser Pro Ph^ Leu Ser His lie Thr Ser Ser 
1045 10501 1055 

ATA CAG GGC CTT GCC ACC ATC CAC GCC TAC AAT AAA GGG CAG GAG TTT 
lie Gin Gly Leu Ala Thr lie His Ala Tyiv^^Asn Lys Gly Gin Glu Phe 
1060 1065 \ 1070 

CTG CAC AGA TAC CAG GAG CTG CTG GAT GAC A^ CAA GCT CCT TTT TTT 
Leu His Arg Tyr Gin Glu Leu Leu Asp Asp Asrt^Gln Ala Pro Phe Phe 
1075 1080 \j.085 



3238 



3286 



3334 



3382 



TTG TTT ACG TGT GCG ATG CGG TGG CTG GCT GTG CGG CTG GAC CTC ATC 3430 
Leu Phe Thr Cys Ala Met Arg Trp Leu Ala Val Ar^Leu Asp Leu lie 
1090 1095 1100 \ 1105 

AGC ATC GCC CTC ATC ACC ACC ACG GGG CTG ATG ATC GT^T CTT ATG CAC 3478 
Ser lie Ala Leu lie Thr Thr Thr Gly Leu Met lie ValVLeu Met His 
1110 1115 \ 1120 

GGG CAG ATT CCC CCA GCC TAT GCG GGT CTC GCC ATC TCT TA*r GCT GTC 3 52 6 

Gly Gin lie Pro Pro Ala Tyr Ala Gly Leu Ala lie Ser TyrNAla Val 
1125 1130 1135^ 

CAG TTA ACG GGG CTG TTC CAG TTT ACG GTC AGA CTG GCA TCT Ga\ ACA 3 574 

Gin Leu Thr Gly Leu Phe Gin Phe Thr Val Arg Leu Ala Ser Glu\Thr 
1140 1145 1150 



GAA GCT CGA TTC ACC TCG GTG GAG AGG ATC AAT CAC TAC ATT AAG ACT 
Glu Ala Arg Phe Thr Ser Val Glu Arg lie Asn His Tyr lie Lys Thr\ 
1155 1160 1165 



3622 



-69- 



CTG TCC TTG GAA GCA CCT GCC AGA ATT AAG AAC AAG GCT CCC TCC CCT 
Leu Ser LeUv Glu Ala Pro Ala Arg lie Lys Asn Lys Ala Pro Ser Pro 
1170 \ 1175 1180 1185 



3670 



GAG TGG CCC 
Asp Trp Pro 



GAG GGA GAG GTG ACC TTT GAG AAC GCA GAG ATG AGG 
Gln\^lu Gly Glu Val Thr Phe Glu Asn Ala Glu Met Arg 
kl90 1195 1200 



3718 



TAG CGA GAA AAC CTCk CCT CTC GTC CTA 7VAG AAA GTA TCC TTC ACG ATC 
Tyr Arg Glu Asn LeuVro Leu Val Leu Lys Lys Val Ser Phe Thr lie 
1205 \ 1210 1215 



3766 



AAA CCT AAA GAG AAG ATT^GGC ATT GTG GGG CGG ACA GGA TCA GGG AAG 
Lys Pro Lys Glu Lys lie Vly lie Val Gly Arg Thr Gly Ser Gly Lys 
1220 \ 1225 1230 



3814 



TCC TCG CTG GGG ATG GCC CTC VtC CGT CTG GTG GAG TTA TCT GGA GGC 
Ser Ser Leu Gly Met Ala Leu Phe Arg Leu Val Glu Leu Ser Gly Gly 
1235 1240 \ 1245 



3862 



TGC ATC AAG ATT GAT GGA GTG AGA ATC AGT GAT ATT GGC CTT GCC GAC 
Cys lie Lys lie Asp Gly Val Arg ire Ser Asp lie Gly Leu Ala Asp 
1250 1255 \ 1260 1265 



3910 



CTC CGA AGC AAA CTC TCT ATC ATT CCT CAA GAG CCG GTG CTG TTC AGT 
Leu Arg Ser Lys Leu Ser lie lie Pro GlX Glu Pro Val Leu Phe Ser 
1270 1275V 1280 



3958 



GGC ACT GTC AGA TCA AAT TTG GAC CCC TTC AAC CAG TAC ACT GAA GAC 
Gly Thr Val Arg Ser Asn Leu Asp Pro Phe AsnVGln Tyr Thr Glu Asp 
1285 1290 \ 1295 



4006 



CAG ATT TGG GAT GCC CTG GAG AGG ACA CAC ATG AAA GAA TGT ATT GCT 
pin lie Trp Asp Ala Leu Glu Arg Thr His Met LysXGlu Cys lie Ala 
1300 1305 MIO 



4054 



CAG CTA CCT CTG AAA CTT GAA TCT GAA GTG ATG GAG AAT^ GGG GAT AAC 
Gin Leu Pro Leu Lys Leu Glu Ser Glu Val Met Glu Asn^ly Asp Asn 
1315 1320 1325 



4102 



TTC TCA GTG GGG GAA CGG CAG CTC TTG TGC ATA GCT AGA GCOv CTG CTC 
Phe Ser Val Gly Glu Arg Gin Leu Leu Cys lie Ala Arg Ala \eu Leu 
1330 1335 1340 \ 1345 

CGC CAC TGT AAG ATT CTG ATT TTA GAT GAA GCC ACA GCT GCC ATgXgAC 
Arg His Cys Lys lie Leu lie Leu Asp Glu Ala Thr Ala Ala Met Xsp 
1350 1355 1360 



4150 



4198 



ACA GAG ACA GAC TTA TTG ATT CAA GAG ACC ATC CGA GAA GCA TTT GCA 
Thr Glu Thr Asp Leu Leu lie Gin Glu Thr lie Arg Glu Ala Phe Ala 
1365 1370 1375 



4246 



GAC TGT ACC ATG CTG ACC ATT GCC CAT CGC CTG CAC ACG GTT CTA GGC 
Asp Cys Thr Met Leu Thr lie Ala His Arg Leu His Thr Val Leu Gly 
1380 1385 . 1390 



TCC GAT AGG ATT ATG GTG CTG GCC CAG GGA CAG GTG GTG GAG TTT GAC 
Ser Asp Arg lie Met Val Leu Ala Gin Gly Gin Val Val Glu Phe Asp 
1395 1400 1405 



4345 
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ACC CCA TCG GTC CTT CTG TCC AAC GAC AGT TCC CGA TTC TAT GCC ATG 43 90 

Thr PrcXser Val Leu Leu Ser Asn Asp Ser Ser Arg Phe Tyr Ala Met 
1410 \ 1415 1420 1425 

TTT GCT GC^GCA GAG AAC AAG GTC GCT GTC AAG GGC TGACTCCTCC 4436 
Phe Ala AlaN^la Glu Asn Lys Val Ala Val Lys Gly 
1430 1435 



CTGTTGACGA AGTOJCTTTT CTTTAGAGCA TTGCCATTCC CTGCCTGGGG CGGGCCCCTT 
CATCGCGTCC TCCTAOCGAA ACCTTGCCTT TCTCGATTTT ATCTTTCGCA CAGCAGTTCC 
GGATTGGCTT GTGTGTTTCA CTTTTAGGGA GAGTCATATT TTGATTATTG TATTTATTCC 
ATATTCATGT AAAC/UU^TTV. TAGTTTTTGT TCTTAATTGC ACTCTAAAAG GTTCAGGGAA 
CCGTTATTAT AATTGTATCA C3AGGCCTATA ATGAAGCTTT ATACGTGTAG CTATATCTAT 
ATATAATTCT GTACATAGCC TATSt^TTTACA GTGAAAATGT AAGCTGTTTA TTTTATATTA 
AAATAAGCAC TGTGCTAAAA AAAAAJU^AAA AAAAAAAAAA AGGGCGGCCG C 



4496 
4556 
4616 
4676 
4736 
4796 
4847 




(2) INFORMATION FOR SEQ ID NO:: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 143 7 amiAp acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID \jO : 2 : 

Met Lys Asp lie Asp lie Gly Lys Glu Tyr life lie Pro Ser Pro Gly 
1 5 10 \ 15 

Tyr Arg Ser Val Arg Glu Arg Thr Ser Thr Ser C^iy Thr His Arg Asp 
20 25 \ 30 

Arg Glu Asp Ser Lys Phe Arg Arg Thr Arg Pro Leu "telu Cys Gin Asp 
35 40 \5 

Ala Leu Glu Thr Ala Ala Arg Ala Glu Gly Leu Ser Leu\Asp Ala Ser 
50 55 60 



Met His Ser Gin Leu Arg lie Leu Asp Glu Glu His Pro Ly^ 
65 70 75 



Gly Lys 
80 



Tyr His His Gly Leu Ser Ala Leu Lys Pro lie Arg Thr Thr Cy^ Lys 
85 90 

His Gin His Pro Val Asp Asn Ala Gly Leu Phe Ser Cys Met Thr 
100 105 110 

Ser Trp Leu Ser Ser Leu Ala Arg Val Ala His Lys Lys Gly Glu Leu' 
115 120 125 

Ser Met Glu Asp Val Trp Ser Leu Ser Lys His Glu Ser Ser Asp Val 
130 135 140 



-71 - 



Asn Cyk Arg Arg Leu Glu Arg Leu Trp Gin Glu Glu Leu Asn Glu Val 
145 \ 150 155 160 

Gly Pro Asp Ala Ala Ser Leu Arg Arg Val Val Trp lie Phe Cys Arg 
\ 165 170 175 



Thr Arg Leu ns|e Leu Ser lie Val Cys Leu Met lie Thr Gin Leu Ala 



185 



190 



Gly Phe Ser Gly ^ro Ala Phe Met Val Lys His Leu Leu Glu Tyr Thr 
195 \ 200 205 

Gin Ala Thr Glu Ser\?Vsn Leu Gin Tyr Ser Leu Leu Leu Val Leu Gly 
210 \ 215 220 

Leu Leu Leu Thr Glu IleVval Arg Ser Trp Se*r Leu Ala Leu Thr Trp 
225 230 \ 235 240- 

Ala Leu Asn Tyr Arg Thr Gl^ Val Arg Leu Arg Gly Ala lie Leu Thr 
245 \ 250 255 

Met Ala Phe Lys Lys lie Leu L^ Leu Lys Asn lie Lys Glu Lys Ser 
260 \ 265 270 

Leu Gly Glu Leu lie Asn lie Cys Ster Asn Asp Gly Gin Arg Met Phe 
275 280 \ 285 

Glu Ala Ala Ala Val Gly Ser Leu Leu Xla Gly Gly Pro Val Val Ala 
290 295 \ 300 

lie Leu Gly Met lie Tyr Asn Val He He Yeu Gly Pro Thr Gly Phe 
305 310 3\5 320 



Leu Gly Ser Ala Val Phe He Leu Phe Tyr ProNAla Met Met Phe Ala 

A 325 330 \ 335 

er Arg Leu Thr Ala Tyr Phe Arg Arg Lys Cys Va.\Ala Ala Thr Asp 
340 345 \ - 350 

Glu Arg Val Gin Lys Met Asn Glu Val Leu Thr Tyr l\e Lys Phe He 
355 360 365 




Lys Met Tyr Ala Trp Val Lys Ala Phe Ser Gin Ser Val (^In Lys He 

370 375 380 

Arg Glu Glu Glu Arg Arg He Leu Glu Lys Ala Gly Tyr Ph^Gln Ser 
385 390 395 \ 400 



He Thr Val Gly Val Ala Pro He Val Val Val He Ala Ser Vd^l Val 
405 410 415\ 



Thr Phe Ser Val His Met Thr Leu Gly Phe Asp Leu Thr Ala Ala Gin 
420 425 430 



Ala Phe Thr Val Val Thr Val Phe Asn Ser Met Thr Phe Ala Leu Lys 
435 440 445 



Val Thr Pro Phe Ser Val Lys Ser Leu Ser Glu Ala Ser Val Ala Val 
450 455 460 
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AspXArg Phe Lys Ser Leu Phe Leu Met Glu Glu Val His Met He Lys 
465 !\ 470 475 480 

AsnlLyk Pro Ala Ser Pro His He Lys He Glu Met Lys Asn Ala Thr 
^ 485 490 495 

Leu Ala Thj^ Asp Ser Ser His Ser Ser He Gin Asn Ser Pro Lys Leu 
500 505 510 

Thr Pro Lys ^^^t Lys Lys Asp Lys Arg Ala Ser Arg Gly Lys Lys Glu 
515 \ 520 525 



Lys Val Arg Gin 
530 



Gin Arg Thr Glu His Gin Ala Val Leu Ala Glu 
535 540 



Gin Lys Gly His Leu \eu Leu Asp Ser Asp Glu Arg Pro Ser Pro Glu 

545 sXo 555 560 

Glu Glu Glu Gly Lys HisNlle His Leu Gly His Leu Arg Leu Gin Arg 

565 \ 570 575 

Thr Leu His Ser He Asp LeuVGlu He Gin Glu Gly Lys Leu Val Gly 

580 \ 585 590 

He Cys Gly Ser Val Gly Ser Gl\ Lys Thr Ser Leu He Ser Ala He 
595 600\ 605 

Leu Gly Gin Met Thr Leu Leu Glu GlV Ser He Ala He Ser Gly Thr 
610 615 \ 620 

Phe Ala Tyr Val Ala Gin Gin Ala Trp iHSe Leu Asn Ala Thr Leu Arg 

625 630 \635 640 



Asp Asn He Leu Phe Gly Lys Glu Tyr Asp GSai Glu Arg Tyr Asn Ser 
645 650 \ 655 

al Leu Asn Ser Cys Cys Leu Arg Pro Asp Leu )^^a He Leu Pro Ser 
660 665 \ 670 

er Asp Leu Thr Glu He Gly Glu Arg Gly Ala Asn \eu Ser Gly Gly 
675 680 6^5 



Gin Arg Gin Arg He Ser Leu Ala Arg Ala Leu Tyr SerN^sp Arg Ser 

690 695 700 

He Tyr He Leu Asp Asp Pro Leu Ser Ala Leu Asp Ala Hi^Val Gly 

705 710 715 \ 720 

Asn His He Phe Asn Ser Ala He Arg Lys His Leu Lys Ser Lys\ Thr 

725 730 735 

Val Leu Phe Val Thr His Gin Leu Gin Tyr Leu Val Asp Cys Asp Gl\ 

740 745 750 

Val He Phe Met Lys Glu Gly Cys He Thr Glu Arg Gly Thr His Glu 

755 760 765 

Glu Leu Met Asn Leu Asn Gly Asp Tyr Ala Thr He Phe Asn Asn Leu 

770 775 780 
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Leu Leu^^Gly Glu Thr Pro Pro Val Glu lie Asn Ser Lys Lys Glu Thr 
785 \ 790 795 800 

Ser Gly Ser^'^n Lys Lys Ser Gin Asp Lys Gly Pro Lys Thr Gly Ser 
\ 805 810 815 

He Lys Lys Glu J^ys Ala Val Lys Pro Glu Glu Gly Gin Leu Val Gin 
820 \ 825 830 

Leu Glu Glu Lys Gly Vln Gly Ser Val Pro Trp Ser Val Tyr Gly Val 
835 \ 840 845 



Tyr He Gin Ala Ala Gly 
850 



Pro Leu Ala Phe Leu Val He Met Ala 
?5 860 



Leu Phe Met Leu Asn Val Gly\Ser Thr Ala Phe Ser Thr Trp Trp Leu 
865 870 \ 875 880 

Ser Tyr Trp He Lys Gin Gly Seiv Gly Asn Thr Thr Val Thr Arg Gly 
885 \ 890 895 

Asn Glu Thr Ser Val Ser Asp Ser Lys Asp Asn Pro His Met Gin 

900 905\ 910 

Tyr Tyr Ala Ser He Tyr Ala Leu Ser MVt Ala Val Met Leu He Leu 
915 920 \ 925 



Lys Ala He Arg Gly Val Val Phe Val Lys h^y 
930 935 



Thr Leu Arg Ala Ser 
940 



Ser Arg Leu His Asp Glu Leu Phe Arg Arg Ile^jeu Arg Ser Pro Met 
945 950 955 \ 960 

Lys Phe Phe Asp Thr Thr Pro Thr Gly Arg He Leu\Asn Arg Phe Ser 
965 970 \ 975 

Lys Asp Met Asp Glu Val Asp Val Arg Leu Pro Phe Gln\Ala Glu Met 
980 985 ^90 

Phe He Gin Asn Val He Leu Val Phe Phe Cys Val Gly Met\Ile Ala 
995 1000 1005 

Gly Val Phe Pro Trp Phe Leu Val Ala Val Gly Pro Leu Val h\ Leu 
1010 1015 1020 \ 

Phe Ser Val Leu His He Val Ser Arg Val Leu He Arg Glu Leu L^s 
1025 1030 1035 10\o 

Arg Leu Asp Asn He Thr Gin Ser Pro Phe Leu Ser His He Thr Ser 
1045 1050 1055 

Ser He Gin Gly Leu Ala Thr He His Ala Tyr Asn Lys Gly Gin Glu 
1060 1065 1070 

Phe Leu His Arg Tyr Gin Glu Leu Leu Asp Asp Asn Gin Ala Pro Phe 
1075 1080 1085 

Phe Leu Phe Thr Cys Ala Met Arg Trp Leu Ala Val Arg Leu Asp Leu 
1090 1095 1100 
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IleXser lie Ala Leu lie Thr Thr Thr Gly Leu Met lie Val Leu Met 
1105\ 1110 1115 1120 

His GlV Gin lie Pro Pro Ala Tyr Ala Gly Leu Ala lie Ser Tyr Ala 
1125 1130 1135 

Val Gin L^ Thr Gly Leu Phe Gin Phe Thr Val Arg Leu Ala Ser Glu 
1140 1145 1150 

Thr Glu Ala Ai;;g Phe Thr Ser Val Glu Arg lie Asn His Tyr lie Lys 
1155 \ 1160 1165 



Thr Leu Ser Leu 
1170 



Ala Pro Ala Arg lie Lys Asn Lys Ala Pro Ser 
1175 1180 



Pro Asp Trp Pro Gin 6n.u Gly Glu Val Thr Phe Glu Asn Ala Glu Met 
1185 l\90 1195 1200 

Arg Tyr Arg Glu Asn Leu\Pro Leu Val Leu Lys Lys Val Ser Phe Thr 
1205 \ 1210 1215 

lie Lys Pro Lys Glu Lys Il^Gly lie Val Gly Arg Thr Gly Ser Gly 
1220 \ 1225 1230 

Lys Ser Ser Leu Gly Met Ala Le\i Phe Arg Leu Val Glu Leu Ser Gly 
1235 124\ 1245 

Gly Cys lie Lys lie Asp Gly Val Ai^ lie Ser Asp He Gly Leu Ala 
1250 1255 \ 1260 



Asp Leu Arg Ser Lys Leu Ser He He 
1265 1270 



Gin Glu Pro Val Leu Phe 
1275 1280 



^er Gly Thr Val Arg Ser Asn Leu Asp Pro Vhe Asn Gin Tyr Thr Glu 
1285 ^ 1290\ 1295 

^sp Gin He Trp Asp Ala Leu Glu Arg Thr Hik Met Lys Glu Cys He 
1300 1305 \ 1310 

\Ala Gin Leu Pro Leu Lys Leu Glu Ser Glu Val M^ Glu Asn Gly Asp 
1315 1320 \ 1325 

Asn Phe Ser Val Gly Glu Arg Gin Leu Leu Cys He Ala Arg Ala Leu 

1330 1335 1340 

Leu Arg His Cys Lys He Leu He Leu Asp Glu Ala Thr Via Ala Met 
1345 1350 1355 \ 1360 

Asp Thr Glu Thr Asp Leu Leu He Gin Glu Thr He Arg Glik Ala Phe 
1365 1370 \37S 

Ala Asp Cys Thr Met Leu Thr He Ala His Arg Leu His Thr VaSl. Leu 
1380 1385 1390 

Gly Ser Asp Arg He Met Val Leu Ala Gin Gly Gin Val Val Glu ^e 
1395 1400 1405 



Asp Thr Pro Ser Val Leu Leu Ser Asn Asp Ser Ser Arg Phe Tyr Ala 
1410 1415 1420 
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Met Pfte Ala Ala Ala Glu Asn Lys Val Ala Val Lys Gly 
1425 \ 1430 1435 

{ 2 ) INFORT^ATION FOR SEQ ID NO : 3 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) YengtH: 463 base pairs 

(B) tVPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3": 
GCGTCCGCCT AGAACGCAGA GATGAG^^AC CGAGAAAACC TCCCTCTCGT CCTAAAGAAA 
GTATCCTTCA CGATCAAACC TAAAGAGA^ ATTGGCATTG TGGGGCGGAC AGGATCAGGG 
AAGTCCTCGC TGGGGATGGC CCTCTTCCGT \tGGTGGAGT TATCTGGAGG CTGCATCAAG 
ATTGATGGAG TGAGAATCAG TGATATTGGC CT^CCGACC TCCGAAGCAA ACTCTCTATC 
ATTCCTCAAG AGCCGGTGCT GTTCAGTGGC ACTG^^GAT CAAATTTGGA CCCTTCAACC 
AGTACACTGA AGACCAGATT TGGGATGCCC TGGAAA^AC ACACATGAAA GAATGTATTG 
CTCCAGCTAC CTCCTGAAAC TTGAATCCTG AATTTGAT^ AGAAATGGGG AAATAACTTC 
TGCAGTTGGG GGAAACGGCA CTCTTTGTTG CCATACCTAn\^CC 
(2) INFORMATION FOR SEQ ID NO : 4 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
\A* (B) TYPE: nucleic acid 

^ (C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 



60 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 
TGCTGGTTCT CTCCCTCACA CTTC 
(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 



24 
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(xijV SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
CTCGGCTCGG. GCTGCTGTTT CCAA 
(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQinmCE CHARACTERISTICS: 

(A) Length : 24 base pairs 

(B) TyPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPGgJOGY: linear 
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(xi) SEQUENCE DESCRfPTION: SEQ ID NO : 6 : 
GGGTGCTGGT GTTTGGAAGT AGTC^ 
(2) INFORMATION FOR SEQ ID N<\: 7 : 

(i) SEQUENCE CHARACTERIST^ 

(A) LENGTH: 24 base p^s^rs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : singl^ 

(D) TOPOLOGY: linear 



24 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
GCCAGAGAAG AAAGCCACGA AAAA 
(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
AGGCACACGA TGGACAGGAT GAGC 
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